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SUMMARY 


The  computerized  neuropsychological  evaluation  of  US  Air 
Force  pilots:  Clinical  procedures  and  data-based  decisions. 

The  neuropsychological  assessment  of  US  Air  Force  pilots 
presents  several  unique  problems  given  their  relatively  high 
cognitive  functioning.  The  United  States  Air  Force  currently  has 
a  baselining  procedure  wherein  student  pilot  candidates  undergo 
computerized  cognitive  assessment.  The  intent  of  this  assessment 
is  to  archive  pre-morbid  data  against  which  to  compare  potential 
future  post-accident  performance.  The  current  work  provides  the 
necessary  background,  clinical  methods,  and  data  in  order  to 
assess  pilots  who  have  suffered  cortical  insult  such  as  trauma, 
disease,  or  exposure  to  toxins.  Methods  are  delineated  for  those 
with  pre-morbid  testing  as  well  as  for  those  pilots  without  such 
testing. 
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The  computerized  neuropsychological  evaluation  of  US  Air 
Force  pilots:  Clinical  procedures  and  data-based  decisions. 


Background 

Aviation  is  one  of  the  most  cognitively  demanding 
occupations.  Any  decline  in  cognitive  ability  is  of  great 
concern  from  a  number  of  perspectives.  After  initial  flight 
training,  a  number  of  cognitive  insults  may  result  in  an 
occupationally  significant  cognitive  decline.  These  insults  can 
include  chronic  alcohol  abuse,  brain  trauma,  cerebrovascular 
insufficiencies,  neurodegenerative  diseases,  and  psychiatric 
disabilities  such  as  depression.  The  resultant  declines  in 
performance  may  be  temporary  or  permanent.  The  complexity  of 
aviation  jobs  and  the  unforgiving  nature  of  the  working  _ 
environment  demands  a  conservative  approach  to  an  occupational 
return  after  even  the  smallest  central  nervous  system  insult.  At 
a  minimum,  medical  and  neurological  evaluations  are  completed, 
but  in  addition,  neuropsychological  assessment  may  be  indicated. 

Regulation,  in  some  instances,  is  used  to  guard  against  the 
potentially  cognitively  impaired  pilot.  Salive  (1994) , 
consideringing  the  mandatory  retirement  of  commercial  pilots  over 
the  age  of  60,  recommended  the  development  of  research  paradigms 
and  data  which  might  be  used  in  the  debate.  He  discussed  the 
history  of  the  Federal  Aviation  Administration  (F^)  rule  as  well 
as  the  epidemiology  of  relevant  diseases  and  testing.  He 
concluded  that  very  sensitive  and  specific  tests  must  be  used  in 
the  medical  and  neuropsychological  examination  of  pilots. 

Neuropsychology  is  the  examination  of  brain-behavior 
relationships .  Clinically,  it  usually  takes  the  form  of  a 
psychologist  using  various  "tests"  to  map  the  cognitive  functions 
of  a  patient  (Vanderploeg,  1994).  These  tests  assess 
intelligence,  attention,  memory,  planning  and  processing,  and 
spatial  abilities,  as  well  as  other  dimensions. 

There  has  been  some  prior  military  work  in  the  area  of 
clinical  cognitive  assessment.  Guilmette  and  Treanor  (1986) 
describe  many  of  the  reasons  for  the  importance  of  the 
neuropsychological  assessment  of  aviators.  They  looked  at  the 
neuropsychological  performance  of  a  group  of  15  Army  aviators 
using  traditional  clinical  tests.  They  found  few  differences 
between  the  aviators  and  a  control  group.  A  potential  research 
confound,  however,  was  the  fact  that  the  control  group  was  a 
remarkably  high  functioning  group  of  15  with  IQs  of  120. 

From  a  civilian  perspective,  Banich,  Stokes,  and  Elledge 
(1989)  reviewed  the  literature  on  the  mental  status  assessment  of 
pilots  They  concluded  that  interview  methods  lack  sensitivity 
and  that  existing  clinical  tests  are  the  alternative  of  choice. 
Specifically,  they  suggested  using  many  of  the  classic  variables 
found  in  multiscale  intelligence  tests .  This  group  later 
(Stokes,  Banich,  and  Elledge,  1991)  demonstrated  that 
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computerized  neuropsychological  tests  were  superior  to  short 
mental  status  examinations.  To  demonstrate  this,  they  compared  a 
group  of  pilots  to  a  group  of  cognitively  impaired  patients . 

Currently  in  the  USAF,  the  Aeromedical  Consultation  Service 
of  Armstrong  Laboratory  at  Brooks  AFB,  TX,  is  responsible  for 
conducting  aviator  medical,  psychiatric,  and  psychological 
evaluations  whenever  a  waiver  is  required  to  continue  flying. 
Results  of  the  evaluations  are  provided  to  the  referral  source 
and  recommendations  as  to  flying  status  are  made.  Standard 
evaluations  include  intelligence,  neuropsychological, 
personality,  psychopathology,  and  neurological  assessment 
procedures.  As  such,  most  referrals  are  seen  by  Neurology, 
Psychiatry,  and  Psychology  functions. 

There  are  a  number  of  problems  in  the  neuropsychological 
assessment  of  any  patient  and  a  number  specific  to  aviators.  For 
all  patients,  it  is  often  difficult  to  infer  pre-morbid  levels  of 
functioning.  This  information,  however,  is  usually  needed  in 
order  to  better  assess  the  current  level  of  functioning  and  any 
change  from  that  prior  level.  From  a  testing  perspective, 
assessment  instruments  must  be  reliable  and  valid  as  well  as 
sensitive  and  specific.  All  too  often  tests  of  limited 
psychometric  value  are  used.  Finally,  the  relationship  between 
the  testing  behavior  and  actual  real-life  functioning  is  at  best 
theoretical . 

For  aviators  specifically,  methodological  difficulties 
result  from  the  fact  that  this  group  possesses  atypically  high 
levels  of  cognitive  ability.  This  often  limits  the  use  of 
certain  statistics  such  as  variance,  reliability,  and  validity 
that  are  gleaned  from  much  more  heterogeneous  samples.  Aviators 
perform  so  well  on  tests  that  many  assumptions  that  are  used  to 
detect  change  in  patients  in  general  are  of  limited  value  when 
applied  to  aviators. 

In  1994,  the  USAF  began  a  program  to  screen  pilot  training 
candidates  prior  to  their  transfer  to  Undergraduate  Pilot 
Training  bases.  The  Enhanced  Flight  Screening  (EFS)  program 
included  actual  flight  training  in  propeller-driven  aircraft  and 
medical  evaluation.  As  a  part  of  the  medical  evaluation,  a 
number  of  psychological  tests  were  mandated.  The  primary  purpose 
of  the  cognitive  tests  is  to  archive  the  individual  pilot's 
scores  for  future  use.  The  intent  is  to  develop  a  registry 
against  which  future  testing  might  be  compared.  In  essence,  pre- 
morbid  data  was  to  be  collected  on  all  pilots  so  that  later 
medical  decisions  would  have  an  empirical  base.  As  such,  the 
psychological  portion  of  the  EFS  program  (King  and  Flynn,  1995) 
includes  traditional  scales  of  intelligence  as  well  as  newer 
computerized  cognitive  tasks. 

Purpose 

The  purpose  of  the  present  paper  is  to  provide  clinical 
procedures  for  the  evaluation  of  pilots  with  cognitive  referral 
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questions  and  to  provide  the  necessary  comparative 
Procedures  are  provided  for  patients  who  have  pre-morbid  EFS 
testing  and  for  those  without  such  testing. 

METHOD 


Subjects 

A  sample  of  537  Air  Force  pilot  training  candidates 
participated  in  this  study.  The  sample  as  a  whole  had  a  mean  age 
of  23.5  (standard  deviation  4.2)  and  about  8%  were  female. 
Subjects  who  had  been  commissioned  through  Officer  Training 
School,  Reserve  Officer  Training  Corps,  and  the  Air  National 
Guard  were  all  college  graduates.  Approximately,  42«  were 
Juniors  at  the  United  States  Air  Force  Academy.  Student  pilot 
candidates  participated  in  the  baseline  cognitive  testing  during 
EFS  either  at  the  Air  Force  Academy  in  Colorado  Springs,  CO,  or 
at  Hondo ,  TX . 


Measures 

The  Multidimensional  Aptitude  Battery  (MAB)  (Jackson,  1985) 
is  a  broad  based  test  of  intellectual  ability.  It  was  patterned 
after  the  Wechsler  Adult  Intelligence  Scale  (WAIS-R;  correlation 
=  91)  the  most  widely  used  individually  administered  test  of 

intelligence.  While  the  WAIS-R  requires  about  an  hour  and  a  half 
per  subject  to  administer,  the  MAB  can  be  given  to  groups  and 
requires  about  the  same  amount  of  total  testing _ time . 
Additionally,  the  WAIS-R  requires  skillful  scoring  while  the  MAB 
has  a  multiple  choice  format.  All  subtests  in  the  WAIS-R  have 
corresponding  paper  and  pencil  subtests  in  the  MAB  except 
immediate  digit  memory.  Verbal  components  tapped  include 
information,  comprehension,  arithmetic,  similarities,  and 
vocabulary.  Performance  measures  include  digit  symbol  coding, 
picture  completion,  spatial,  picture  arrangement,  and  object 
assembly.  Scores  on  each  of  the  subtests  are  scaled  to  a  mean  of 
50  and  a  standard  deviation  of  10.  Verbal  and  performance  sub 
scores  are  available  as  is  a  full  scale  intelligence  score,  each 
scaled  to  a  mean  of  100  and  a  standard  deviation  of  15. 
Reliabilities  for  the  summary  scores  range  from  .94  to  .98. 

Current  testing  in  the  USAF  Enhanced  Flight  Screening 
program  (King  and  Flynn,  1995),  other  US  Air  Force  research 
programs  (Flynn,  Sipes,  Grosenbach,  and  Ellsworth,  1994;  Retzlaf 
and  Gibertini,  1987),  NASA's  astronaut  selection  procedure,  and  a 
number  of  civilian  airline  screening  procedures  include  the  MAB. 

The  version  of  the  MAB  used  in  the  current  study  was 
primarily  the  computerized  version  (Retzlaff,  King,  and 
Canister  1995a)  .  Here  verbal  questions  are  presented  as  text 
on  a  computer  screen  and  subjects  are  asked  to  respond  to  t  e 
computer  with  an  a,  b,  c,  d,  ore  keyboard  entry.  The 
performance  items  were  scanned  into  computer  graphic  files  a 
Sre  presented  in  a  window  on  the  monitor.  This  computerization 
was  done  and  is  used  with  the  consent  of  the  test  author  with 
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explicit  copyright  permission.  It  is  important  to  note  that  the 
1990  norms  for  the  MAE  were  used  for  this  study.  These  norms  are 
used  in  the  computer  scoring  software  from  the  publisher . 

Earlier  work  with  the  test  or  other  current  paper-and-pencil  type 
administrations  use  the  original  1985  norms.  Hence,  direct 
comparison  with  data  such  as  Retzlaff  and  Gibertini's  (1988)  may 
be  difficult. 

The  CogScreen-Aeromedical  Edition  (Kay,  1995)  is  a  test  of 
cognitive  ability  intended  for  use  in  the  assessment  of  pilots. 
While  the  MAB  is  a  test  of  relatively  complex,  higher  order 
intellectual  processes,  the  CogScreen  tasks  are  generally  more 
fundamental  processes  such  as  reaction  time.  It  is  not  a  test  of 
aviation  knowledge  but  considered  to  include  abilities  necessary 
in  the  performance  of  aviation  duties  (Kay  and  Horst,  1988) . 

There  are  11  tasks  which  result  in  65  scores.  The  tasks  include 
Backward  Digit  Span  (BDS) ,  Math  (MATH) ,  Visual  Sequence 
Comparison  (VSC) ,  Symbol  Digit  Coding  (SDC) ,  Matching- to -Sample 
(MTS) ,  Manikin  (MAN) ,  Divided  Attention  (DAT) ,  Auditory  Sequence 
Comparison  (ASC) ,  Pathfinder  (PF) ,  Shifting  Attention  (SAT) ,  and 
Dual  Task  (DTT) .  Each  of  the  tasks  is  usually  scored  in  a  number 
of  ways.  Typical  scorings  include  task  speed,  accuracy,  and 
throughput.  Throughput  is  a  function  of  speed  and  accuracy, 
basically  the  number  of  correct  responses  per  minute.  It  is 
indicative  of  the  amount  of  work  accomplished.  A  number  of  tasks 
also  include  process  completion  measures  which  quantify  task 
specific  behavior  such  as  control  of  the  computer  screen 
elements.  The  manual  and  other  research  refers  to  the  CogScreen 
scores  by  a  relatively  cryptic  variable  naming  process.  These 
variable  names  are  defined  in  Appendix  A. 

The  CogScreen  is  relatively  new  and  represents  an  attempt  by 
its  authors  to  produce  an  assessment  device  which  met  a  number  of 
Federal  Aviation  Administration  requirements.  It  is  currently 
used  in  the  EPS  program  by  the  USAF,  by  the  US  Navy,  and  by  a 
number  of  commercial  airlines.  It  is  published  and  available 
from  one  of  the  major  psychological  test  publishers. 

The  CogScreen  was  used  as  provided  by  the  test  publisher. 
Computer  software  administers  the  test,  times  the  tasks,  scores 
the  tests,  and  archives  the  data  in  report  form. 

Clinical  methods 

There  are  three  major  manners  in  which  to  use  the  available 
data  (Retzlaff  and  Gibertini,  1994) .  The  first  is  the  intended 
purpose  of  EPS.  This  procedure  compares  the  archived  data  (pre- 
morbid)  to  later  testing  (post-morbid) ,  presumably  after  some 
sort  of  cognitive  insult. 

The  other  two  procedures  acknowledge  the  fact  that  not  all 
pilots  will  have  archived  pre-morbid  data.  This  may  be  the  case 
because  either  they  became  pilots  before  the  program  began  or 
they  become  pilots  after  the  program  was  terminated  (if  indeed 
the  program  is  terminated) .  These  two  procedures  use  data 
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developed  from  those  taking  the  EFS  testing.  As  such,  the  second 
procedure  looks  at  the  relative  ability  level  of  the 
given  the  known  ability  levels  for  the  tested  group.  The  third 
and  final  method  uses  a  number  of  the  tests  for  a  new  subject 
control  conditions  for  other  tests  taken  at  the  same  time. 

in  Performance  Method  The  first  method  is  a  pre-test, 
post -test  paradigm! It  is  the  most  reliable  but  requires  prior, 
pre-morbid  testing  data  against  which  to  compare  later  testing. 

In  the  general  clinical  case,  a  patient  may  have  prior 
intelligence  and  neuropsychological  testing,  been  exposed  to 
cortical  insult,  and  then  re-tested.  An  example  might  be  a 
patient  in  the  Veteran's  Administration  system.  It  would  be 
Common  for  a  patient  to  have  a  prior  intelligence  test  such  as  a 
WAIS-R  somewhere  in  the  system,  have  some  sort  of  cortical  insult 
such  as  a  stroke  or  head  injury,  and  then  be  re -tested  on  the 
same  intelligence  test.  Here  the  results  of  the  first  testing 
can  be  used  as  a  reference  for  the  second  testing.  A  significan 
decrement  across  testings  would  establish  the  existence  of  a 
dementia  and  gauge  the  general  severity  of  it . 

The  degree  to  which  test  scores  may  vary  from  one  testing  to 
the  next  can  be  established  statistically.  "Normal"  or  chance 
degrees  of  differences  can  be  established  through  studying  the 
stability  of  normal  subjects  across  two  testing  periods.  The 
first  testing  is  correlated  with  the  second  to  establish  a 
stability  (reliability)  coefficient.  This  coefficient  can  be 
used  to  determine  a  confidence  band  around  a  score.  Performa 
beyond  this  confidence  band  would  suggest  performance  decrements 
beyond  what  might  be  expected  by  chance. 

For  aviators  who  have  participated  in  the  EFS  program,  pre- 
morbid  data  is  available  and  can  be  retrieved  from  Armstrong 
Laboratory.  Knowing  the  aviator's  initial  perfomance,  the 
stability  coefficient  of  the  test,  and  the  variability  of  the 
test  for  aviators,  confidence  bands  can  be  established  for 
an  individual  aviator.  Performance  below  what  can  be  expected 
statistically  on  the  MAB  or  CogScreen  may  be  taken  as  evidence  of 
an  impairment . 

of  Performance  Method  To  date,  only  a  very  small 
percentage  of  USAF  aviators  have  archived  EFS  testing.  As  such, 
methodologies  are  necessary  for  the  assessment  of 
without  pre-morbid  testing.  Here  the  EFS  data  onl^B  and 
CogScreen  variables  may  be  used  as  a  group  reference, 
with  poor  performance  on  testing  following  some  insult  may  be 
inferred  to  be  at  that  low  level  of  performancedue  to  the 
cortical  insult.  Aviators  who  are  found  to  be  in  the  bottom  on 
percent  following  some  trauma,  for  example,  are  statistically 
more  likely  to  be  at  that  level  due  to  the  trauma  than  due  to 
tSeL  initial  perfomance.  In  other  words,  there  would  only  be  a 
one  percent  chance  that  the  aviator  was  pre-morbidly  at  that  low 
level  of  perfomance. 
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In  order  to  effectively  utilize  this  approach,  a  number  of 
statistics  and  tables  are  necessary.  First,  the  tneans  and 
standard  deviations  of  a  large  sample  of  fairly  similar 
individuals  is  required.  This  provides  the  norm  against  which  to 
compare  a  new  individual's  scores.  In  addition  to  these 
statistics,  percentile  levels  of  various  scores  are  often  of  use. 
While  the  mean  and  standard  deviations  model  the  underlying 
distribution  of  test  scores  when  the  distribution  is  normal,  they 
do  not  model  skewed  distributions  well  when  there  is  an 
asymmetry  in  scores .  Providing  the  scores  of  a  distribution  at 
critical  percentile  points  allows  the  scores  of  new  patients  to 
be  very  accurately  placed  relative  to  their  peers . 

Pattern  of  Performance  Method  While  the  above  method  uses  a 
large  group  of  subjects  as  the  comparison  for  an  individual  s 
post-insult  scores,  it  is  also  possible  to  use  some  elements  of 
the  person' s  own  performance  to  make  conclusion  regarding 
cognitive  change.  A  common  approach  uses  the  effects  of  aging  on 
various  types  of  test  performance  as  a  model .  It  has  long  been 
known  that  some  types  of  intellectual  ability  are  fairly 
sensitive  to  aging  and  other  types  are  quite  resistant  to  change . 
Classically,  these  are  referred  to  as  "hold"  and  "don't  hold" 
variables.  Scores  on  tasks  such  as  vocabulary  and  general 
information  generally  are  similar  across  age  brackets.  These 
tasks  tend  to  "hold"  as  one  ages.  Scores  on  other  tasks  such  as 
performance  type  tests  like  speed  dependent  visuomotor  ability 
usually  drop  off  with  age.  Here,  somewhere  in  the  fifth  decade 
of  life,  performances  "don't  hold"  and  begin  a  fairly  constant 
decline . 

Applying  this  method  to  younger  patients  who  have  had  some 
type  of  cortical  insult  suggests  that  larger  differences  in 
scores  between  "hold"  and  "don't  hold"  tests  is  associated  with 
greater  levels  of  impairment.  It  is  common,  for  example,  to  look 
at  the  difference  between  the  Vocabulary  subtest  on  the  WAIS-R 
and  the  Digit  Symbol  subtest.  If  the  Digit  Symbol  subtest  is 
more  than  2  or  3  standard  scores  below  the  Vocabulary  subtest 
score  (and  there  is  history  of  insult) ,  there  is  a  good 
likelihood  of  impairment. 

There  are  always  naturally  occurring  differences  between  two 
sub- tests  on  any  test.  It  is,  therefore,  necessary  to  quantify 
this  natural  difference  so  that  referred  aviators  might  be 
compared  to  the  "normal"  differences.  Aviators  whose  difference 
scores  between  two  tests  are  in  the  top  99%  of  non- impaired 
aviators  can  be  assumed  to  have  that  level  of  difference  due  to 
insult,  as  the  a  priori  chance  of  that  difference  is  quite  low. 

Results  and  Application 
Change  in  Performance  Method 

Table  1  provides  the  means  and  standard  deviations  for  each 
of  the  MAB  scores.  These  include  summary  scores  as  well  as 
scaled  and  raw  scores.  The  scaled  scores  are  based  upon  the  1990 
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norms.  The  raw  scores  are  provided  here  and  in  subsequent  tables 
in  the  event  that  there  is  a  re-norming  of  the  test.  As  can  be 
seen,  pilots  are  on  average  quite  intelligent  with  Full  Scale  IQ 
scores  of  119.  This  table  also  includes  the  stability  ^ 
coefficient,  the  standard  error  of  measurement,  andthe  95^ 
confidence  band  for  each  of  the  scores.  The  stability 
coefficient  is  based  upon  the  testing  and  retesting  of  a  group  of 
subjects  during  the  development  of  the  test.  It  indicates  the 
degree  to  which  scores  remain  constant  across  time.  The  standard 
error  of  estimate  statistic  indicates  the  variability  of  scores 
that  could  be  expected  from  multiple  testings  of  the  same  person. 
Finally,  the  95%  confidence  band  indicates  the  differences  in 
scores  that  might  be  expected  at  the  95%  probability  level.  This 
final  confidence  band  can  be  applied  to  any  individual's  scores. 
If  a  second  testing  is  below  the  confidence  band,  the  performance 
should  be  interpreted  as  lower  and  more  deficient  than  what  can 
be  expected  simply  due  to  measurement  error. 


As  an  example,  suppose  a  pilot  received  a  Full  Scale  IQ 
score  of  125  during  initial  EFS  screening.  The  pilot  is  then 
involved  in  a  car  accident  with  a  brief  coma.  The  pilot  is 
referred  for  follow-up  cognitive  testing.  The  expected  range  of 
scores  for  this  pilot  would  be  125  plus  or  minus  2.38  points.  As 
such,  the  range  would  be  123  to  127.  The  MAE  is  re  administere 
and  the  Full  Scale  IQ  score  is  118 .  Since  this  is  well  below  the 
bottom  of  the  confidence  band  (123) ,  there  is  good  reason  to 
suspect  a  true  decrement  in  ability.  Obviously,  it  is  another 
question  whether  an  IQ  of  118  is  too  low  to  continue  flying, 
nevertheless,  an  impairment  is  verified.  Further  testing  and 
other  evidence  can  assess  the  question  of  continued  flying. 


There  are  ten  subscales  which  can  also  be  used  to  answer 
more  specific  functional  questions  in  the  same  manner.  It  is 
of  particular  importance  when  a  referral  question  specifically 
mentions  an  error  of  concern  such  as  spatial  ability  and 
subsequent  testing  indicates  performance  on  the  spatial  subtest 
well  below  the  confidence  band.  Additional  evidence  might  be 
aathered  from  the  number  of  subscales  below  the  bands.  A  pilot 
with  only  one  of  the  tests  below  the  band  is  very  different  from 
a  pilot  with  all  ten  subtests  below  the  bands. 

With  65  variables,  the  CogScreen  is  somewhat  difficult  to 
interpret  (See  Appendix  A  for  variable  names) .  In  order  to 
better  understand  the  data,  it  is  presented  not  by  subtest  but  by 
type  of  score.  As  such,  speed  variables  are  presented  first, 
followed  by  accuracy,  throughput,  and  process  variables. 

Table  2  provides  not  only  the  means  and  standard  deviations 
for  the  CogScreen  speed  variables  but  also  the  stability 
coefficient,  the  standard  error  of  estimate,  and  the  95-s 
confidence  band.  The  stability  coefficient  was  taken  from  the 
test  manual  and  used  specifically  to  develop  the  other  two 
statistics  for  this  sample. 
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Here,  for  example,  a  subject's  reaction  time  speed  score  on 
the  Math  task  would  have  to  be  banded  by  plus  and  minus  8.26 
seconds.  As  such,  a  subject  with  a  pre-morbid  score  of  30.00 
seconds  would  have  a  95%  statistical  probability  of  producing  a 
score  between  21.74  and  38.26  seconds.  A  clinically  important 
finding  would  be  a  score  significantly  slower  such  as  42  seconds. 
In  this  example,  a  pilot  with  a  pre-morbid  score  of  30  seconds 
probably  has  a  decline  from  prior  functioning  with  that  score  of 
42  seconds.  Conversely,  a  post-morbid  score  of  35  seconds  is 
within  the  measurement  error  range  and  should  not  be  clinically 
interpreted  as  a  decline. 

With  so  many  speed  scores,  it  is  important  not  to  calculate 
so  many  statistics  on  a  single  patient  that  the  method  becomes  a 
"fishing  trip"  with  a  "drift  net" .  The  two  CogScreen  tasks  with 
the  best  speed  characteristics  are  probably  the  MTS  (Matching  to 
Sample)  and  MAN  (Manikin).  These  tasks  require  a  small  amount  of 
cognitive  performance  directed  toward  a  fairly  focal  stimuli. 

With  average  performance  in  the  one  and  a  half  to  two  second 
range,  there  is  sufficient  room  for  variable  performance.  Tasks 
which  have  much  shorter  reaction  times  are  probably  prone  to  be 
confounded  by  the  use  of  the  light  pen,  the  use  of  large  muscle 
groups,  subtle  shifts  of  position,  administration  differences, 
and  software  changes.  Tasks  such  as  MATH  are  not  true  reaction 
times.  The  30  seconds  or  so  of  task  time  includes  attention, 
reading  speed,  math  calculation  time,  and  reaction _ time .  As 
such,  it  is  a  heterogeneous  task,  and  hence  of  limited 
interpretive  value  here . 

Tables  3,  4,  and  5  provide  the  means  and  standard  deviations 
for  the  CogScreen  accuracy,  throughput,  and  process  variables. 

The  accuracy  scores  have  so  little  variance  in  normal  pilots  that 
the  calculation  of  stability  coefficients,  standard  errors  of 
measurement/  estimate,  and  confidence  bands  is  inappropriate. 

This  lack  of  variance  is  also  noted  in  the  manual  for  the 
normative  sample.  The  reason  that  the  scores  vary  so  little  is 
due  to  "ceiling  effect".  The  tasks  are  so  easy  that  most 
subjects  (at  times  over  90%)  get  all  tasks  correct  and  as  such 
there  is  no  separation  of  performance  on  the  high  end  of  ability . 
Since  throughtput  variables  are  the  product  of  speed  and  accuracy 
variables,  they  add  little  information  over  the  speed  data. 
Finally,  the  manual  does  not  present  stability  data  for  the 
process  variables  and  as  such  confidence  bands  cannot  be 
calculated.  Here  is  an  example  of  where  a  USAF  stability  study 
would  allow  for  such  data. 

Level  of  Performance  Method 

Table  6  provides  the  percentile  levels  for  the  MAE  variable^ 
distributions.  A  subject  with  a  score  of  129  would  be  at  the  95% 
and  be  quite  intelligent  compared  to  other  pilots.  For  clinical 
purposes  with  a  patient  who  did  not  have  prior  testing,  these 
data  can  be  interpreted  as  the  probability  of  a  post -insult 
decrement  in  functioning. 
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The  chances  that  a  pilot  has  a  Full  Scale  IQ  score  of  100  is 
about  1%,  because  only  1%  of  the  sample  have  Full  Scale 
IQ  scores  of  100  or  less.  One  way  to  interpret  this  data  ^ 
clinically  is  to  say  that  there  is  a  99%  chance  that  the  pilot 
with  the  IQ  of  100  had  an  IQ  of  greater  than  100  prior  to  any 
cognitive  insult.  Here  the  very  fact  of  exceptionally  low 
performance  is  in  and  of  itself  unlikely  and  most  probably  due  to 
clinical  factors. 

In  general,  scores  in  the  lower  1%  and  5%  levels  are 
probably  clinically  relevant.  Again,  the  quality  (which  scales) 
and  quantity  (how  many  scales)  are  of  interest.  Performance 
scores  and  tasks  are  more  important  for  aviators  and  also  more 
prone  to  cognitive  decline  with  insult.  Conversely,  pilots  with 
scores  in  the  top  95%  and  99%  are  probably  able  to  return  to  duty 
and  clinical  significant  impact  is  highly  unlikely. 

Tables  7,  8,  9,  and  10  provide  similar  cutscores  for  the 
CogScreen  speed,  accuracy,  throughput,  and  process  distributions. 
For  the  speed  data  in  Table  7,  performance  is  in  seconds  and 
therefore  larger  numbers  represent  poorer  performance.  While  on 
the  MAB  higher  scores  are  better,  here  lower  scores  are  better. 
Very  fast  answering  of  the  Math  items  might  result  in  a  score  of 
15  seconds.  This  would  place  that  subject  at  the  5%  level,  a 
very  good  performance. 

A  patient,  however,  who  spends  45  seconds  on  average  would 
be  somewhere  between  the  95%  and  99%  level .  That  patient  had  a 
very  small  chance  of  taking  that  much  time  given  the  group  norms 
and  so  is  probably  impaired.  Again,  the  quality  and  quantity  of 
scores  must  be  part  of  the  clinical  decision  process. 

Again,  as  with  the  speed  variables  in  the  CogScreen,  it  is 
recommended  that  the  Matching  to  Sample  (MTS)  and  Manikin  (MAN) 
tasks  be  used  for  most  clinical  work.  They  exhibit  good  range 
across  the  sample  and  are  less  prone  to  error  than  the  faster, 
pure  reaction  time  tasks. 

Table  8  provides  the  tail  of  the  distribution  associated 
with  low  accuracy  scores .  Full  tables  are  not  possible  due  to 
the  limited  variance  of  these  scores.  In  essence,  most  pilots 
got  these  tasks  right  with  a  few  pilots  getting  some  tasks  wrong. 
Using  MATH  as  the  example  again,  a  pilot  who  only  gets  a  .20 
proportion  of  the  MATH  questions _ correct  is  at  the  bottom  1%  of 
the  distribution.  A  .40  proportion  would  place  that  pilot  at 
only  the  15%  level.  Either  score  should  be  of  clinical  concern. 

Table  9  presents  the  throughput  data.  Here,  higher  scores 
represent  very  fast,  accurate,  and  efficient  cognitive 
processes .  Low  scores  represent  poor  performance .  A  throughput 
of  0  3  on  the  MATH  task  would  represent  a  performance  at  the 
first  percentile  of  the  distribution.  This  would  suggest  a 
impairment  relative  to  the  norms . 
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Finally,  Table  10  presents  the  distributions  for  the  process 
variables.  The  table's  footnote  indicates  the  direction  of 
performance  and  the  tails  of  clinical  concern.  Here,  again,  a 
number  of  the  variables  had  highly  skewed  distributions  with 
limited  variance  and  only  a  limited  number  of  distribution  points 
could  be  mapped. 

Pattern  of  Performance  Method 

Table  11  provides  the  statistically  expected  differences  in 
scaled  scores  across  tests  given  to  a  single  subject  at  a  single 
point  in  time.  The  MAB  is  used  here  because  the  variables  are 
widely  used  and  understood.  The  CogScreen  is  not  presented 
because  no  theory  or  research  exists  on  its  interscale  behavior 
in  impaired  individuals. 

The  approach  here  is  that  variables  such  as  Vocabulary  and 
Information  are  relatively  resistant  to  cognitive  insult. _  The 
performance  tasks  (Digit  Symbol,  Picture  Conlpletion,  Spatial, 
Picture  Arrangement,  and  Object  Assembly)  are  far  more  likely  to 
be  affected  by  an  impairing  incident.  Difference  scores, 
however,  will  naturally  vary  quite  widely  in  non- impaired 
individuals  and  must  be  modeled. 

To  develop  this  data,  the  scaled  scores  for  each  of  the 
performance  tasks  was  subtracted  from  the  scaled  score  of 
Vocabulary  and  Information.  This  resulted  in  a  distribution  of 
difference  scores  for  the  sample.  The  means  and  standard 
deviations  are  presented  in  Table  11.  On  average,  pilots  have 
better  performance  scores  than  Vocabulary  scores  as  evidenced  by 
the  negative  difference  scores.  Their  scores  on  Information  are 
more  similar  to,  and  slightly  better  than,  their  scores  on  the 
performance  tasks  with  difference  scores  of  generally  1  to  3 
points. 

The  data  of  interest  are  those  differences  which  are 
positive  and  large.  This  would  clinically  suggest  that 
performance  type  ability  is  well  below  the  traditional  "hold 
verbal  tests.  The  "hold"  tests  would  have  "held"  and  the  "don't 
hold"  tests  would  have  "not  held" .  The  bottom  line  of  a  positive 
and  large  score  would  be  a  cognitive  impairment. 

If  a  patient  had  a  scaled  Vocabulary  score  of  60  and  a  Digit 
Symbol  score  of  45,  the  difference  would  be  15  points.  Looking 
at  the  table,  a  15  point  difference  would  place  this  patient  well 
above  the  99th  percentile.  A  clinician  could  be  99%  certain  that 
such  scores  would  not  be  found  in  non-impaired  pilots. 

It  is  recommended  that  the  Scaled  Vocabulary  score  minus  the 
Scaled  Digit  Symbol  score  be  used  for  most  purposes.  Vocabulary 
seems  to  behave  best  in  this  population  and  appears  to  have  the 
most  stable  norming  across  studies  using  the  MAB.  Digit  Symbol 
is  a  complex,  heterogeneous  task  which  is  sensitive  to  many 
functional  declines.  The  raw  score  difference  scores  are 
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unstable  due  to  the  lack  of  a  common  underlying  metric  and  are 
provided  here  for  reference  only. 

DISCUSSION 

The  accurate  assessment  of  the  cognitive  functioning  of 
pilots  is  essential.  The  lives  and  careers  of  pilots  and  the 
lives  of  crews  and  passengers  may  depend  upon  it.  The  USAF  also 
is  interested  in  increasing  mission  effectiveness,  reducing 
training  costs,  and  managing  retention. 

The  USAF  EFS  program  provides  an  opportunity  to  collect 
large  sets  of  cognitive  data  on  pilot  candidates  (Callister, 

King,  and  Retzlaff ,  1995) .  No  other  study  or  function  has 
ever  allowed  for  such  large  samples  or  for  the  archiving  of 
individual  data . 

Three  clinical  methods  for  the  neuropsychological  assessment 
of  pilots  have  been  delineated.  A  method  using  pre-morbid  test 
data  for  those  pilots  with  archived  EFS  data  has  been  explored. 
Additionally,  two  methods  have  been  explained  for  the  testing  of 
pilots  without  pre-morbid  testing  available.  The  necessary 
statistical  tables  are  presented  for  clinical  use. 

A  number  of  caveats  must  be  mentioned.  First,  these  data 
are  from  pilot  candidates.  As  such  there  is  some  chance  that  the 
data  are  not  as  precise  as  they  might  be.  A  number  of  studies, 
however,  have  found  very  similar  intelligence  test  data.  Also 
Retzlaff,  King,  and  Callister  (1995b)  _ found  no  differences  in 
intelligence  between  those  entering  pilot  training  and  those 
finishing.  The  CogScreen  is  less  well  known  and  larger 
differences  may  operate. 

It  would  also  have  been  better  to  use  stability  coefficients 
which  had  been  calculated  from  an  Air  Force  pilot  sample.  The 
use  of  general  stability  coefficients  from  the  test  manuals  are 
within  the  normal  range  of  practice,  but  a  one  year  test-retest 
study  of  a  group  of  mid- career  pilots  would  have  provided  much 
more  specific  statistics. 

Finally,  it  is  important  to  note  that  this  is  a  relatively 
atypical  approach  to  neuropsychology  driven  by  the  unique  needs 
of  the  USAF  medical  baselining  requirements.  Psychology  has  a 
long  history  of  neuropsychological  tests,  assessment,  and 
methods.  Traditional  neuropsychological  assessment  includes  many 
tests  across  many  hours  of  individualized  testing.  It  is  fully 
expected  that  the  current  work  will  be  in  addition  to,  not  in 
place  of,  the  traditional  techniques. 
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Table  1 

Means  and  standard  deviations  for  all  MAB  variables . 


Variable 

Mean 

SD 

r 

SEE 

95% 

10  Scores 

Full  Scale 

119.3 

7.0 

.97 

1.21 

2.38 

Verbal 

118.1 

7.0 

.95 

1.57 

3.07 

Performance 

118.0 

8.8 

.96 

1.76 

3.45 

Scaled  Scores 

Information 

67.4 

6.8 

.97 

1.18 

2.31 

Comprehension 

60.1 

4.1 

.95 

0.92 

1.80 

Arithmetic 

62.3 

6.5 

.88 

2.25 

4.41 

Similarities 

62.1 

4.8 

.83 

1.98 

3.88 

Vocabulary 

58.4 

6.4 

.90 

2.02 

3.97 

Digit  Symbol 

66.4 

6.8 

.90 

2 . 15 

4.21 

Picture  Completion 

63.7 

6.8 

.94 

1.67 

3.26 

Spatial 

63.5 

7.3 

.93 

1.93 

3 . 79 

Picture  Arrangement  60.1 

7.2 

.87 

2.60 

5 . 09 

Object  Assembly 

64.5 

7.5 

.93 

1.98 

3 . 89 

Raw  Scores 

Information 

29.4 

4.5 

.97 

0.78 

1.53 

Comprehension 

23.4 

2.2 

.95 

0.49 

0.96 

Arithmetic 

15.7 

2.0 

.88 

0 . 69 

1.36 

Similarities 

27.8 

3.0 

.83 

1.24 

2.42 

Vocabulary 

29.2 

5.7 

.90 

1.80 

3.53 

Digit  Symbol 

29.2 

3.4 

.90 

1.08 

2.11 

Picture  Completion 

26.9 

3.7 

.94 

0 . 91 

1 .78 

Spatial 

36.8 

6.8 

.93 

1.80 

3.53 

Picture  Arrangement  12.6 

2.1 

.87 

0 . 76 

1.48 

Object  Assembly 

15.7 

3.1 

.93 

0.82 

1 . 61 

Note:  N=537,  r  is  the  stability  (reliability)  taken  from  the 
MAB  manual,  SEE  is  the  Standard  Error  of  Estimate,  and 
95%  is  the  95%  confidence  interval. 
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Table  2 


Means  and  standard  deviations  for  CogScreen  Speed  variables . 


Variable 


4  MATHRTC 
7  VSCRTC 
14  MTSRTC 
17  MANRTC 
19  DATIRTC 
21  DATDRTC 
23  DATSCRTC 
27  ASCRTC 
30  PFNRTC 
34  PFLRTC 
38  PFCRTC 
42  SATADRTC 
45  SATACRTC 
48  SATINRTC 
51  SATDIRTC 
57  DTTAABS 
59  DTTDABS 
61  DTTPARTC 
64  DTTDRTC 


Mean 

SD 

r 

SEE 

95% 

27.25 

8.79 

.77 

4.22 

8.26 

2.24 

.51 

.89 

.17 

.33 

1.47 

.28 

.79 

.13 

.25 

1.98 

.38 

.85 

.15 

.29 

.40 

.07 

.68 

.04 

.08 

.69 

.20 

.63 

.12 

.24 

2.15 

.53 

.84 

.21 

.42 

.98 

.24 

.75 

.12 

.24 

.85 

.16 

.77 

.08 

.15 

.79 

.13 

.75 

.07 

.13 

1.20 

.30 

.80 

.13 

.26 

.70 

.10 

.80 

.04 

.09 

.68 

.09 

.72 

.05 

.09 

.86 

.15 

.91 

.05 

.09 

.95 

.21 

.76 

.10 

.20 

24.12 

19.50 

.65 

11.54 

22.61 

49.42 

26.06 

.85 

10.09 

19.78 

.48 

.19 

.76 

.09 

.18 

.66 

.24 

.72 

.  13 

.25 

Note:  N=512,  r  is  the  stability  (reliability)  taken  from  the 
CogScreen  manual,  SEE  is  the  Standard  Error  of  Estimate,  and 
95%  is  the  95%  confidence  interval.  All  scores  are  in  seconds 
except  DTTAABS  and  DTTDABS  which  are  distance  measures  in  fixed 
seconds.  Appendix  A  provides  full  variable  definitions. 
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Table  3 


Means  and  standard  deviations  for  CogScreen  Accuracy  variables. 
Variable 


Mean  SD 


2  BBS ACC 

3  MATHACC 
6  VSCACC 
9  SDCACC 

11  SDCIRACC 

12  SDCDRACC 

13  MTS ACC 
16  MANACC 
24  DATSCACC 
26  ASCACC 
29  PFNACC 
33  PFLACC 
37  PFCACC 
41  SATADACC 
44  SATACACC 
47  SATINACC 
50  SATDIACC 
60  DTTPAACC 
63  DTTPDACC 


.89 

.12 

.72 

.19 

.97 

.03 

.99 

.01 

.94 

.13 

.93 

.15 

.95 

.05 

.93 

.09 

.89 

.07 

.90 

.10 

.99 

.01 

.99 

.01 

.98 

.03 

.98 

.03 

.99 

.03 

.97 

.03 

.67 

.11 

.93 

.07 

.86 

.11 
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Table  4 


Means  and  standard  deviations  for  CogScreen  Throughput  variables. 


Variable 


Mean 

SD 

5 

MATHPUT 

1.82 

1.22 

8 

VSCPUT 

27.56 

6.20 

10 

SDCPUT 

33.74 

6.00 

15 

MTS PUT 

40.44 

7.73 

18 

MANPUT 

29.51 

7.05 

25 

DATSCPUT 

26.32 

6.47 

28 

ASCPUT 

58.79 

17.48 

31 

PFNPUT 

72.00 

12.86 

35 

PFLPUT 

77.46 

12.24 

39 

PFCPUT 

51.83 

12.54 

43 

SATADPUT 

86.55 

12.77 

46 

SATACPUT 

88.51 

11.20 

49 

SATINPUT 

69.59 

11.64 

52 

SATDIPUT 

44.64 

11.68 

62 

DTTPAPUT 

131.25 

46.15 

65 

DTTPDPUT 

90.85 

38.48 
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Table  5 


Means  and  Standard  Deviations  for  CogScreen  Process  variables. 


Variable 


Mean  SD 


20  DATIPRE 
22  DATDPRE 
32  PFNCOOR 
36  PFLCOOR 
40  PFCCOOR 

53  SATDIRUL 

54  SATDIFAI 

55  SATDIPER 

56  SATDINON 

58  DTTAHIT 

59  DTTDHIT 


2  . 

.52 

1. 

.80 

2  , 

,22 

2  . 

.04 

0. 

.80 

0  . 

.33 

0  , 

.95 

0, 

.35 

0  . 

.87 

0  , 

.31 

6  , 

.96 

2  , 

.50 

2  , 

.15 

1, 

.92 

1, 

.89 

2 

.51 

1 

.57 

2 

.71 

0 

.92 

1 

.95 

3 

.49 

3 

.39 
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Table  6 


Percentiles  for  MAB  variables. 


Variable 

1% 

5% 

15% 

50% 

85% 

95% 

99% 

10  Scores 

Full  Scale  100 

107 

112 

119 

126 

129 

133 

Verbal 

99 

106 

110 

118 

125 

129 

132 

Performance 

93 

101 

109 

118 

126 

131 

135 

Scaled  Scores 
Information 

47 

53 

59 

68 

73 

76 

79 

Comprehension 

49 

52 

56 

60 

64 

65 

67 

Arithmetic 

45 

51 

54 

60 

67 

70 

79 

Similarities 

46 

53 

56 

62 

66 

69 

70 

Vocabulary 

43 

47 

52 

58 

65 

68 

73 

Digit  Symbol 

48 

54 

58 

66 

72 

76 

78 

Picture  Completion 

45 

51 

56 

64 

69 

73 

75 

Spatial 

37 

52 

57 

63 

70 

74 

77 

Picture  Arrangement 

42 

48 

52 

59 

65 

72 

76 

Object  Assembly 

40 

49 

56 

65 

70 

72 

74 

Raw  Scores 

Information 

16 

20 

24 

30 

33 

35 

37 

Comprehension 

17 

19 

21 

23 

25 

26 

27 

Arithmetic 

10 

12 

13 

15 

17 

18 

21 

Similarities 

18 

22 

24 

28 

30 

32 

33 

Vocabulary 

15 

19 

23 

29 

35 

38 

42 

Digit  Symbol 

20 

22 

25 

29 

32 

34 

35 

Picture  Completion 

17 

20 

23 

27 

30 

32 

33 

Spatial 

12 

26 

31 

36 

43 

47 

49 

Picture  Arrangement 

7 

9 

10 

12 

14 

16 

17 

Object  Assembly 

5 

9 

12 

16 

18 

19 

20 

20 


Table  7 


Percentiles  for  Cogscreen  Speed  variables. 


Variable  1%  5% 


4  MATHRTC  6.62  15.00 

7  VSCRTC  1.30  1.48 

14  MTSRTC  0.92  1.08 

17  MANRTC  1.26  1.41 

19  DATIRTC  0.25  0.28 

21  DATDRTC  0.36  0.44 

23  DATSCRTC  1.36  1.48 

27  ASCRTC  0.43  0.62 

30  PFNRTC  0.55  0.63 

34  PFLRTC  0.55  0.61 

38  PFCRTC  0.66  0.82 

42  SATADRTC  0.51  0.56 

45  SATACRTC  0.51  0.55 

48  SATINRTC  0.61  0.67 

51  SATDIRTC  0.61  0.71 

57  DTTAABS  3.27  4.00 

59  DTTDABS  9.44  12.93 

61  DTTPARTC  0.21  0.28 

64  DTTDRTC  0.25  0.34 


15%  50%  85%  95% 


18. 

39 

26. 

,20 

36. 

07 

44  . 

01 

1. 

,75 

2. 

,18 

2. 

,67 

3  . 

,18 

1. 

,20 

1. 

,44 

1. 

.72 

1. 

,92 

1. 

,57 

1. 

.95 

2. 

.39 

2  , 

.68 

0, 

.33 

0. 

.40 

0. 

.46 

0. 

.52 

0. 

.52 

0, 

.66 

0. 

.87 

1 , 

.09 

1. 

.63 

2. 

.05 

2  . 

,66 

3. 

.15 

0. 

.74 

0. 

.97 

1. 

.22 

1. 

.39 

0  . 

.70 

0. 

.83 

1. 

.01 

1. 

.16 

0. 

.66 

0. 

.77 

0. 

.91 

1 . 

.04 

0, 

.92 

1, 

.15 

1, 

.48 

1, 

.75 

0, 

.60 

0 

.68 

0 

.80 

0 

.90 

0 

.59 

0 

.67 

0 

.77 

0 

.85 

0 

.71 

0 

.84 

1 

.01 

1 

.16 

0 

.77 

0 

.92 

1 

.12 

1 

.31 

6 

.13 

18 

.17 

43 

.33 

65 

.75 

18 

.32 

47 

.37 

79 

.75 

94 

.  12 

0 

.32 

0 

.43 

0 

.68 

0 

.87 

0 

.43 

0 

.61 

0 

.88 

1 

.  13 

99% 


49.71 

3.91 

2.21 

2.99 

0.59 

1.34 

3.76 
1.73 
1.34 
1.15 
2.20 
1.02 
0.98 
1.31 

1 . 77 
86.56 

104.28 

1.13 

1.44 
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Table  8 


Percentiles  for  Cogscreen  Accuracy  variables. 


Variable  5% 


2 

BDSACC 

.50 

.67 

3 

MATHACC 

.20 

.40 

6 

VSCACC 

.85 

.90 

9 

SDCACC 

.95 

.97 

11 

SDCIRACC 

.50 

.83 

12 

SDCDRACC 

.33 

.50 

13 

MTSACC 

.80 

.85 

16 

MANACC 

.70 

.80 

24 

DATSCACC 

.74 

.80 

26 

ASCACC 

.60 

.70 

29 

PFNACC 

.96 

n/a 

33 

PFLACC 

.92 

.96 

37 

PFCACC 

.88 

.92 

41 

SATADACC 

.83 

.92 

44 

SATACACC 

.83 

.92 

47 

SATINACC 

.88 

.91 

50 

SATDIACC 

.42 

.55 

60 

DTTPAACC 

.77 

.86 

63 

DTTPDACC 

.61 

.74 

Note :  Performance  in  general  was  so  high  on  the  PFNACC 
task  that  there  was  insufficient  variability  to  derive 
a  15%  cutscore. 
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Table  9 


Percentiles  for  Cogscreen  Throughput  variables. 


Variable  1% 


5 

MATHPUT 

0.3 

8 

VSCPUT 

14 

10 

SDCPUT 

22 

15 

MTS PUT 

23 

18 

MANPUT 

14 

25 

DATSCPUT 

13 

28 

ASCPUT 

25 

31 

PFNPUT 

44 

35 

PFLPUT 

51 

39 

PFCPUT 

25 

43 

SATADPUT 

57 

46 

SATACPUT 

59 

49 

SATINPUT 

42 

52 

SATDIPUT 

16 

62 

DTTPAPUT 

36 

65 

DTTPDPUT 

20 

5% 

15% 

50% 

0.7 

1.0 

1.7 

19 

22 

27 

25 

28 

33 

29 

33 

40 

18 

22 

29 

16 

20 

26 

35 

44 

57 

51 

59 

71 

57 

66 

77 

33 

40 

51 

64 

73 

88 

69 

77 

89 

50 

58 

69 

21 

32 

45 

56 

77 

131 

37 

53 

86 

85%  95%  99% 


2.5 

3.2 

5 . 8 

34 

40 

45 

39 

45 

51 

46 

53 

62 

37 

41 

46 

33 

37 

41 

73 

90 

121 

85 

95 

105 

91 

97 

109 

64 

72 

84 

99 

105 

115 

99 

106 

112 

82 

87 

97 

56 

62 

67 

175 

202 

251 

126 

157 

216 
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Table  10 


Percentiles  for  Cogscreen  Process  variables . 


Variable 


32  PFNCOOR 
36  PFLCOOR 
40  PFCCOOR 

53  SATDIRUL 
20  DATIPRE 
22  DATDPRE 

54  SATDIFAI 

55  SATDIPER 

56  SATDINON 

58  DTTAHIT 

59  DTTDHIT 


5% 

15% 

50% 

85% 

95% 

0.20 

0.40 

0.80 

1.10 

1.30 

0.30 

0.60 

0 . 90 

1.30 

1.50 

0.40 

0.50 

0.80 

1.10 

1.50 

0 

4 

4 

6 

4 

6 

4 

6 

4 

6 

3 

7 

2 

5 

8 

10 

N  512 

Note:  Only  the  Coordination  variables  had  sufficient  range 
and  resolution  to  allow  percentiles  across  all  ranges.  Lower 
scores  on  SATDIRUL  indicates  poorer  performance.  Higher  scores  on 
all  other  variables  indicates  poorer  performance. 
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Table  11 


Percentiles  for  MAB  variable  difference  scores. 
Variable  Mean  SD  85%  95%  SB'S 

Scaled  Vocabulary  minus  Scaled: 


Digit  Symbol 

-8.0 

8.6 

0 

6 

11 

Picture  Completion 

-5.3 

7.7 

2 

8 

15 

Spatial 

-5.1 

8.9 

3 

9 

19 

Picture  Arrangement 

-2.4 

8.9 

6 

12 

17 

Object  Assembly 

-6.1 

8.9 

2 

7 

17 

Scaled  Information  minus  Scaled 

: 

Digit  Symbol 

0.9 

9.1 

10 

15 

22 

Picture  Completion 

3.7 

7 . 9 

11 

17 

23 

Spatial 

3.4 

8.8 

12 

18 

26 

Picture  Arrangement 

6.6 

8.6 

15 

19 

25 

Object  Assembly 

2.9 

9.0 

11 

17 

22 

Raw  Vocabulary  minus  Raw: 

Digit  Symbol 

-0.1 

6.2 

6 

10 

14 

Picture  Completion 

2.2 

5.7 

8 

11 

16 

Spatial 

-7.7 

8.1 

0 

5 

14 

Picture  Arrangement 

16.5 

5.8 

22 

25 

30 

Object  Assembly 

13.4 

6.0 

19 

23 

2  6 

Raw  Information  minus  Raw: 


Digit  Symbol 
Picture  Completion 
Spatial 

Picture  Arrangement 
Object  Assembly 


0.2 

5.4 

5 

8 

12 

2.5 

4 . 8 

7 

10 

14 

-7.4 

7.2 

-1 

4 

13 

16.8 

4.5 

21 

23 

25 

13 . 7 

4.9 

18 

21 

23 

25 


Appendix  A 

CoaScreen  Variable  Definitions 


1  BDSACC 

Accuracy 

Math 

2  MATHACC 

Accuracy 

3  MATHRTC 

Speed 

4  MATHPUT 

Thruput 

Visual  Sequence  Comoarison 

5  VSCACC 

Accuracy 

6  VSCRTC 

Speed 

7  VSCPUT 

Thruput 

Symbol  Diait 

Codina 

8  SDCACC 

Accuracy 

9  SDCPUT 

Thruput 

10  SDCIRACC 

Immediate  Recall  Accuracy 

11  SDCDRACC 

Delayed  Recall  Accuracy 

Mat china  to 

Samole 

12  MTS ACC 

13  MTSRTC 

14  MTS PUT 


Accuracy- 

Speed 

Thruput 


Manikin  Test 

15  MANACC 

16  MANRTC 

17  MANPUT 


Accuracy 

Speed 

Thruput 


Divided  Attention  Test 

18  DATIRTC 

19  DATIPRE 

20  DATDRTC 

21  DATDPRE 

22  DATSCACC 

23  DATSCRTC 

24  DATSCPUT 


Indicator  alone  speed 

Indicator  alone  premature  response 

Indicator  dual  speed 

Indicator  dual  premature  response 

Sequence  comparison  accuracy 

Sequence  comparison  speed 

Sequence  comparison  thruput 


Auditory 

25  ASCACC 

26  ASCRTC 

27  ASCPUT 


Sequence  Comparison 
Accuracy 
Speed 
Thruput 
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Pathfinder 

28  PFNACC  Number  accuracy 

29  PFNRTC  Number  speed 

30  PFNPUT  Number  thruput 

31  PFNCOOR  Number  coordination 

32  PFLACC  Letter  accuracy 

33  PFLRTC  Letter  speed 

34  PFLPUT  Letter  thruput 

35  PFLCOOR  Letter  coordination 

36  PFCACC  Combined  accuracy 

37  PFCRTC  Combined  speed 

38  PFCPUT  Combined  thruput 

39  PFCCOOR  Combined  coordination 


Shifting  Attent 

40  SATADACC 

41  SATADRTC 

42  SATADPUT 

43  SATACACC 

44  SATACRTC 

45  SATACPUT 

46  SATINACC 

47  SATINRTC 

48  SATINPUT 

49  SATDIACC 

50  SATDIRTC 

51  SATDIPUT 

52  SATDIRUL 

53  SATDIFAI 

54  SATDIPER 

55  SATDINON 


■  ion  Test 

Arrow  direction  accuracy 
Arrow  direction  speed 
Arrow  direction  thruput 
Arrow  color  accuracy 
Arrow  color  speed 
Arrow  color  thruput 
Instruction  accuracy 
Instruction  speed 
Instruction  thruput 
Discovery  accuracy 
Discovery  speed 
Discovery  thruput 
Discovery  rule  shifts  completed 
Discovery  failed  set 
Discovery  perseveration  errors 
Discovery  nonconcept  response 


Dual  Task  Test 

56  DTTAABS  Tracking  alone  error 

57  DTTAHIT  Tracking  alone  boundary  hits 

58  DTTDABS  Tracking  dual  error 

59  DTTDHIT  Tracking  dual  boundary  hits 

60  DTTPAACC  Previous  number  alone  accuracy 

61  DTTPARTC  Previous  number  alone  speed 

62  DTTPAPUT  Previous  number  alone  thruput 

63  DTTPDACC  Previous  number  dual  accuracy 

64  DTTDRTC  Previous  number  dual  speed 

65  DTTPDPUT  Previous  number  dual  thruput 
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